rank | frequency | n-gram |
---|---|---|
1 | 13541 | -s |
2 | 13426 | -e |
3 | 6008 | -t |
4 | 3751 | -r |
5 | 3680 | -n |
rank | frequency | n-gram |
---|---|---|
1 | 6248 | -es |
2 | 3760 | -nt |
3 | 2433 | -on |
4 | 2360 | -er |
5 | 1827 | -re |
rank | frequency | n-gram |
---|---|---|
1 | 2505 | -ent |
2 | 1862 | -ion |
3 | 1193 | -ons |
4 | 1016 | -ant |
5 | 932 | -que |
rank | frequency | n-gram |
---|---|---|
1 | 1563 | -tion |
2 | 1160 | -ment |
3 | 768 | -ique |
4 | 675 | -ions |
5 | 526 | -ques |
rank | frequency | n-gram |
---|---|---|
1 | 1091 | -ation |
2 | 979 | -ement |
3 | 515 | -tions |
4 | 438 | -iques |
5 | 258 | -tique |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings